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Balanced truncation, a technique from robust control theory, is a systematic method for pro- 
ducing simple approximate models of complex linear systems. This technique may have significant 
applications in physics, particularly in the study of large classical and quantum systems. These 
notes summarize the concepts and results necessary to apply balanced truncation. 
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I. INTRODUCTION 

Theoretical physics endeavors to produce models that 
predict observed physical phenomena. Tl·iough some- 
times the challenge is to develop a mathematical lan- 
guage describing the system of interest, often (especially 
in the study of complex systems) one can write down 
the exact dynamics and gain little insight — the result- 
ing expressions are too cumbersome, too messy, or too 
ill-conditioned to be uscful. By exploiting symmetries 
and other characteristics of the particular system, one 
may find a simpler equivalent description of the dynam- 
ics. This description may be further simplified by us- 
ing approximation techniques, e.g., asymptotic limits and 
small-parameter expansions. Much of the art of the field 
lies in finding and choosing ad hoc mcthods for deriving 
these simpler models; however, more systematic methods 
are clearly desirable. 

Control theorists have developed a variety of model 
reduction techniques that systematically produce simple 
models of complex systems. These notes will describe 
balanced truncation jï], |^j, a model reduction technique 
for linear systems which is readily available in a vari- 
ety of formats (e.g. MATLAB). Balanced truncation has 
recently been applied in physics contexts ||, Q], and the 
results suggest it will prové a useful tool for treating large 
systems in both classical and quantum settings. 

Much of this discussion comes directly from Dullerud 
and Paganini Q . We will present the important concepts 
and results necessary to apply balanced truncation, omit- 
ting both the proofs and the algorithms. We refer the 
reader to [Q and Q for a more complete mathematical 
discussion, and to MATLAB toolboxes and their docu- 
mentation Q for the computational methods. 

In section II we will describe the input-output 
paradigm of control theory, and introduce state-space 
models, the class of systems treatable by balanced trun- 
cation. Given an arbitrary state-space model, we will 
characterize the smallest state-space model with identical 
input-output characteristics in section III, and in section 
[V we will show how balanced truncation is used to find 
smaller models with controlled approximation errors. 




FIG. 1: An example input-output system. 
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II. INPUT-OUTPUT MAPS AND STATE-SPACE 
MODELS 

In many physics settings one is more concerned with 
the macroscopic behavior of a large system, and less con- 
cerned with the system's microscopic details. As an ad- 
mittedly contrived example, consider a pendulum in a 
plane at whose free end is a tank partially fillcd with a 
classical fluid (see Fig. [!]). Suppose that at time t = 
the system is at rest in its stable equilibrium, and our 
only method of disturbing the system is to exert a time- 
varying torque r(í) at the pivot. Suppose further that 
we are concerned only with the time evolution of pendu- 
lum's angle 8(t) — not with the distribution of the fluid, 
given by some high-dimcnsional variable $(£). 

An exact model including the full fluid state $ would 
give the system's exact response to a driving torque, but 
would be quite impractical. As we only wish to describe 
the angular response to the driving torque (a mapping 
from one degree of freedom to another) one suspects a 
lower-dimensional model might suffice. For example, one 
might try treating the fluid as a point mass attached to 
the pendulum by a non-lincar spring. 

Systems of this sort are naturally phrased in a control 
theory language. In the typical control scenario, a time- 
varying input u{t) drives a system with state x(t) giving 
output signal y(t), and the system dynamics have the 
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FIG. 2: A block diagram. 



form 



x(t) 

y{t) 



f(x(t),u(t)) 
h(x(t),u(t)). 



(1) 



Such systems are depicted by a block diagram as shown 
in Fig. ^. In the example of the pendulum, the system's 
state is given by x — (9, 9, <£>, <£>) so as to describe the 
evolution with the first-order dynamics (|ï|). The input 
to the system is u(t) — r(í), and the system's output is 
y(t) = 9{t). 

Together with some initial condition (typically a;(0) = 
0), the functions / and h in (Q) define an input-output 
map "F taking u to y. Since it is this relation with which 
we are concerned, rather than the system's internal dy- 
namics, a theoretical model for the system will suffice if it 
describes <F . Given some system of the form (Q), model 
reduction aims to produce simpler models (i.e. models 
with a lower-dimensional state x) that approximate the 
original input-output map. 

We will consider models of the form 



x = Ax + Bu 
y = Cx + Du 



(2) 



where x e K", u e E m , and y e W, and A, B, C and D 
are time-independent real matrices of sizes n x n, n x to, 
p x n and p x m respectively. (This entire discussion also 
holds for complex-valued systems.) Such linear models 
are called state-space models, and the order of a model 
is n, the dimension of the state x. For compactness, the 
model with matrices A, B, C and D is denoted by 



(3) 



(This notation should not be confused with the (n+p) x 
(n + to) matrix with blocks A, B, C and D.) Often the 
output will only depend on the system state, i.e. D = 0. 

Consider the dynamics (||) under a change-of-basis z = 
Tx, where T is invertible but need not be unitary. The 
dynamics may then be written as 




i = TAT- 1 z + TBu 
y = CT^z + Du. 



(4) 



Thus changing the basis of the state space defines a map- 
ping on state-space models given by 



(5) 




TAT- 1 


TB 


CT- 1 


D 




n" < rí 




y(t) w y(t) 



FIG. 3: Given an order n model, we íirst reduce the system 
to the lowest order n' < n with the same input-output map, 
and then reduce to an approximate model of order n" < n' . 



(Note that D is unchanged.) We will call such maps 
similarity transformations. Because these transforma- 
tions are merely a rewriting of the system dynamics, the 
input-output map remains the same. 

As a given state-space model is not a unique descrip- 
tion of an input-output map i', in the next section we ask: 
what is the lowest-order model with the same $ as the 
given model? After finding a lowest-order exact model, 
we will use balanced truncation to find lower-order mod- 
els approximating , F. This procedure is summarized in 
Fig. | 



III. LOWEST-ORDER EXACT MODELS 

To hnd state-space models of lowest order with the 
exact \Jj of a given model, we will split the problem into 
two parts: controllability and observability. We will then 
combine these ideas to find minimal realizations of the 
system. 



A. Controllability 

Assume the system is initially in the state x(0) = 0; 
given a time r > 0, the controllable states Xf are those 
for which there is an input signal u(t) yielding x(t) = Xf. 
The dynamics (j|) can be integrated to yield 



x{t) 



,A(T-t) 



which gives a linear map í'i 1 "' : u — > x(t). The control- 
lable states form the image of this map. Since the map 
is linear, the image is a subspace of the state-space E™, 
called the controllable subspace. We denote this subspace 



Bu(t)dt, 



(6) 
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by Cab, as controllability only depends on the matrices 
A and B. 

Using properties of the matrix exponential it can be 
shown that the controllable subspace is the image of the 
controllability matrix: 



C AB = Image [B AB A B . . . A n ~ B] 



(7) 



Thus we see that Cab is independent of r. It can also 
be shown that the controllable subspace is the image of 
the controllability gramian, an n x n matrix given by 



X G = 



BB^e AU dt, 



(8) 



and the orthogonal subspace of uncontrollable states is 
given by the controllability gramian's kernel. If Cab = 
R" (i.e. there exists an input signal u to prepare any 
state x{t)) then we say that (A, B) is a controllable pair. 

It can be shown that given any A and B, we can find a 
similarity transformation such that the transformed ma- 
trices have the block structure 



A = TAT- 



A n A 12 
A 22 



B = TB 



Bi 




(9) 



with (Au, Bi) a controllable pair. Writing the state vec- 
tor as x = (xi, x 2 ) corresponding to this block structure, 
we have 



±i = Auxi 
x 2 = 



A x2 x 2 
A 22 x 2 . 



Biu 



(10) 



Because x(0) = 0, these dynamics yield x 2 (t) = for all 
time. Thus the dynamics for xi reduce to 



Xi 



AnXi + Biu. 



(11) 



Because (An,Bi) is a controllable pair, we may choose 
an input u to prepare any state xi (r) , and thus the trans- 
formed controllable subspace C^jj is given by the states 
of the form (xi,0). The orthogonal subspace, given by 
states of the form (0, x 2 ), is irrelevant to the input-output 
map since no input can affect these states. 



B. Observability 

We now consider another problem with a similar struc- 
ture. Suppose the system is in some initial state x(0) — 
xq and u = for all time. Based on the output y(t) for 
< t < t, can we uniquely identify xq? Integrating the 
dynamics (|J) yields 

y(t)=Ce At x Q , (12) 

which gives a linear map : xq —> y (where by y we 
mean the output signal y(t) for < t < r). Suppose 

two initial states Xq and xi give the same y. As *1 T) 



is linear, the initial state xo — Xi must give y = 0. We 
call initial states giving output y = unobservable, since 
any unobservable state may be added to any other initial 
state without changing the output. 

(r) 

The unobservable states form the kernel of $o ; as 
the map is linear these states form a subspace, called 
the unobservable subspace and denoted by Mca- It can 
be shown that the unobservable subspace is given by the 
kernel of the observability matrix: 



Nca = ker 



C 

CA 

CA n ~ 



(13) 



Thus Nca is independent of r. It can also be shown that 
ÀÍca is given by the kernel of the observability gramian 



Yn = 



C J{ Ce At dt, 



(14) 



and the orthogonal subspace of observable states is given 
by the observability gramian's image. If Mca = {0}> 
then the entire space is observable, and we say that (C, A) 
is an observable pair. As the observable states are given 
by the image of (O) and the controllable states are given 
by the image of (ra), (C,A) is an observable pair if and 
only if (A*, C>) is a controllable pair. 

It can be shown that given any C and A, we can find a 
similarity transformation such that the transformed ma- 
trices have the block structure 



A = TAT- 



C 



CT~ 



An 
A 2 i A 22 

Ci 



(15) 



with (Ci, Au) an observable pair. Writing the state vec- 
tor as x = (xi, x 2 ) corresponding to this block structure, 
we have 



±i = Anxi 
x 2 = A 2í xi + A 22 x 2 
y = Cixi. 



(16) 



Thus the time evolution of xi is never affected by x 2 , 
and the output signal y only depends on xi. Because 
(Ci, Au) is an observable pair, we can uniquely identify 
an initial state x(0) = (ïi,0) based on y. The trans- 
formed unobservable subspace A/g^ is given by the states 
of the form (0,2:2), and is irrelevant to the input-output 
map since no output can be affected by these states. 



C. Minimal Realizations 

The notions of controllability and observability give us 
a means of deciding whether a state affects the system's 
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input-output map: if a state is unobservable, it does not 
affect the output, and if a state is uncontrollablc, it is 
unaffected by the input. Only those states that are both 
controllable and observable are of relevance. 

We say that a state-space model given by matrices 
(A, B,C, D) is a minimal realization if no lower-order 
model gives the same input-output map í'. The intuition 
above can be made precise as follows: it can be shown 
that a model is a minimal realization if and only if all 
states are both controllable and observable, i.e. (A 1 B) is 
a controllable pair and (C, A) is an observable pair. 

Given a state-space model given by A, B, C and D we 
may find a minimal realization by isolating only those di- 
mensions which are both controllable and observable. To 
do so, we perform a Kalman decompositioTi, which simul- 
taneously performs the transformations (^|) and (^) as 
follows. For any state-space model there exists a similar- 
ity transformation such that the matrices of transformed 
model have the block structure 



/A u A 



TAT- 1 


TB 


CT- 1 


D 



13 







A 2 i 


A 22 


-423 


A 2i 


B 2 






















-443 


Au 





V c 1 





C 3 





D ) 



Au 


B x 


Ci 


D 



(17) 

and, writing the state vector as x = (x±, Xi, x%, X4) cor- 
responding to the block structure, the controllable states 
are of the form (xi,x 2 ,0, 0) and the observable states are 
of the form (xi, 0, 2:3, 0). Thus only states of the form 
[x\, 0, 0, 0) are both controllable and observable. 

Eliminating all but the states (xi, 0,0,0) yields the 
state-space model 



(18) 



Since we have only eliminated states irrelevant to the 
input-output map, this reduced system has the exact 
same \E r as the original system. Further, since states of 
the form (x\, 0,0,0) were both controllable and observ- 
able, (Au,È\) is a controllable pair and {C\,Au) is an 
observable pair; thus this model is a minimal realization. 

Somctimes the uncontrollable and unobservable states 
are obvious from the form of a physical system's dynam- 
ics, e.g. some degrees of freedom are uncoupled, or some 
symmetry can be exploited. In other circumstances, the 
physics may not make it clear which states can be elim- 
inated without affecting the input-output map. Espe- 
cially in the latter situation, an algorithmic method such 
as the Kalman decomposition (available in MATLAB) 
can be quite advantageous. Noncthclcss, onc fccls intu- 
itively that one should always be able to find a minimal 
realization analytically if one is sufficiently clever. How- 
ever, we do not expect in general that Standard analytic 
methods will be useful in seeking lower-order approxima- 
tions, as we will do in the next section. 



IV. LOWER-ORDER APPROXIMATE MODELS 

To apply balanced truncation, we will first assume that 
we have reduced the model to a minimal realization. We 
make the further assumption that the resulting system is 
exponentially stable, i.e. all eigenvalues of A have strictly 
negative real part. (Various methods exist to extend 
these methods to unstable systems, e.g., jï|, but we as- 
sume stability to prové the Standard result.) 



A. Quantifying Observability and Controllability 

In the previous section we distinguished between states 
that were observable and unobservable; we now wish to 
quantify the observability of the observable states. To do 
so, consider the output signal for t > that results from 
the initial state x(0) = xq when there is no input u. This 
signal is given by ([12[). Using the £2 norm for the signal 
y, we have 



Ml 2 



f °°yl(t)y(t)dt 

4 (Jo°° 1 

x Y o x 



AH 



CCe At dt 



Xq 



(19) 



where Yo is the observability gramian given by (||) with 
t — > 00. Recali that the kernel of Yo is independent of 
t, so choosing r — > 00 will not change which states are 
observable and which states are unobservable. (System 
stability is required to ensure convergence of the integral 
in this limit.) Scaling the norm-squared of the output by 
the norm-squared of the initial state yields 



\y\\ 2 _ x\ Y oxo 



IN 



XqXq 



(20) 



which quantifies the observability of the states in the di- 
rection of xq. 

From (|ï^) we see that the observability gramian is Her- 
mitian and positive semidefinite. As we have a minimal 
realization, all states xq are observable (xqYoXq > 0), 
and so Yo is strictly positive definite. Thus in geomèt- 
ric terms analogous to the moment-of-inertia tensor, Yo 
defines an "observability ellipsoid" in the state space, 
with the longest principal axes along the most observ- 
able directions (see Fig. ||). A similarity transforma- 
tion given by T transforms the observability gramian by 
Y -> (T-^YoT' 1 . As T need not be unitary, this 
transformation may rescale the ellipsoid's axes as well as 
rotate them. 

We quantify controllability in a similar fashion. Sup- 
pose that at a time well in the past (r — * —00) the sys- 
tem is in the state x = 0, and some input u(t) drives 
the system for t < 0, yielding a final state x(0) = xq. 
As we have a minimal realization, all states xq are con- 
trollable and therefore can be prepared in this fash- 
ion. For each state xq there is a minimum signal size 



*opt| 



J_ u(t)i u(t)dt required to yield x(0) 



x . 
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The smaller this minimum signal, the more sensitive this 
state is to the input signal. Thus states with a smaller 



-^opt 



I 2 are said to be more controllablc. 



Consider the controllability gramian Xc given by (|8|) 
with r —>■ oo. Just as Yo, Xc is Hermitian and positivc 
semidefinite, and because we have assumed a minimal 
realization, Xc is strictly positive definite and therefore 
invertible. It can be shown that 



IKpt|| 2 = x\x c l x . 



It follows that 



*opt| 



ll^o 



xlXçXp 

t ' 

XqXq 



(21) 



(22) 



which quantifies the controllability of the states in the 
direction of x . Just as with Y Q , X c defines a "controlla- 
bility ellipsoid" in state space, with the longest principal 
axes along the most controllable directions. A similarity 
transformation given by T transforms the controllability 
gramian by X c -> TX c Tl 

We have thus found that the gramians give us a use- 
ful measure of a state's observability and controllability. 
One might ask whether the observability and controllabil- 
ity matrices of ( |Ï3| ) and (0) could serve a similar purpose, 
but in fact they are only useful for determining whether 
a given state is observable/controllable. One might also 
ask if r — > oo is necessary — we could choose finite r, 
and quantify controllability and observability on a finite 
time horizon. However, finite r does not lead to the error 
bound in the approximations of the next section, which 
is the main result of these notes. 



B. Balanced Truncation 

With the above quantification of observability and con- 
trollability, one might be tempted to prescribe some al- 
gorithm like eliminating the least observable or least con- 
trollable dimensions in the state space to yield a lower- 
order approximate model. However, such an approach 



FIG. 5: The balancing transformation T transforms the ob- 
servability and controllability ellipsoids to an identical ellip- 
soid aligned with principle axes along the coordinate axes. 



would not necessarily be successful. Supposc, for exam- 
ple, that the least observable states were in the direction 
of the unit vector x, but that states in this direction were 
extremely controllable. Thus a small signal u might lead 
to the infernal state x = Ai with A large. Though this 
state is the least observable, A might be sufnciently large 
that the resulting output signal is non-negligible. 

Instead, we wish to use observability and controllabil- 
ity to yield a single measure of a state's importance to 
the input-output map It can be shown that given two 
positive definite square matrices Xc and Yo of the same 
size, there exists an invertible T such that 



TX C T^ = (T-yY T" 



(23) 



where E is diagonal with positive real diagonal entrics. 
Such a similarity transformation is called a balancing 
transformation. Geometrically, balancing transforms the 
observability and controllability ellipsoids so that they 
are identical and their principal axes lie on the coordi- 
nate axes of the state space (see Fig. ||). 

The resulting S is unique up to permutation of the di- 
agonal elements, so we may choose T yielding the trans- 
formed gramians 



Xc = Yo = E = 



hi 



h„ 



(24) 



with hi > /12 > • • • > h n > 0. The hi are called Hankel 
Singular Vàlues (HSVs); in this transformed system hi 
is the quantitative measure of both the observability and 
controllability of the unit basis vector èj. Thus the ba- 
sis vectors have been sorted in order of relevance to the 
input-output map. 
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Once the system is balanced, we may truncate the 
state-space dimensions with low HSVs to yield lower- 
order approximate models. Intuitively, the smaller the 
HSVs corresponding to truncated dimensions, the better 
the approximation. We will now make this idea precise. 

Let the balanced state-space model with sorted HSVs 
be given by matrices A, B, C and D. Choosing some 
r such that h r is strictly greater than h r+ i, we write 
the state vector x — (xi,x?) where x\ gives the first r 
coordinates, and x 2 gives the last n — r coordinates. We 
then write the state-space model in the corresponding 
block structure 



(25) 




All 


A21 


Bi 


A 12 


A22 


B 2 


Ci 


c 2 


D 



and truncate the n — r least significant dimensions of the 
model, yielding the order r model 



(26) 



Au 


Bi 


c 2 


D 



This procedure is called balanced truncation. 

We now give bounds on the resulting approximation 
error. Let u(t) be some input signal on í e (—00,00) 
that is finite with respect to the £2 norm 



z(t)h(t)dt. 



(27) 



Let y(t) and y(t) be the resulting output signals of the 
original and truncated systems respectively. Stability of 
the original system ( p5[ ) and the strict inequality h r > 
h r+ i guarantees stability of the truncated system (26); 
since both systems are stable, the output signals y and 
y are also finite with respect to the norm ( |27| ) , as is the 
error signal y — y. Now, letting > > . . . > /i]L r be 
the distinct HSVs of the truncated n — r dimensions, it 
can be shown that 



< max 



\y - y\ 



(28) 



Thus we have both an upper and lower bound on the 
worst-case errors resulting from balanced truncation. 



Given the Hankel Singular Vàlues of a system, we may 
truncate to a system of desired order and bound the re- 
sulting error, or we may choose the order of truncation 
based on the maximum tolerable error. 

The advantages of balanced truncation as method of 
approximating the input-output map are two-fold. First, 
the error is bounded as above, in contrast to more tra- 
ditional approximations where errors estimated by the 
order of some small parameter, e.g. 0(e 2 ). Second, the 
algorithmic process is efHcient. Balanced truncation does 
not necessarily yield the optimal order-r approximation 
in the sense of the error bound (|2|), but fmding the op- 
timal approximation may be computationally difhcult. 

For single-input single-output systems (i.e. u, y S R), 
the error bound may also be described in terms of fre- 
queney responses. Given a sinusoidal input, a state-space 
model will give an output with the same frequeney and 
a frequeney-dependent amplitude and phase shift: 



u(t) = sin(wí) — > y(t) = A^ sin(wí + <p u ). 



(29) 



Let A u and <f) u be the amplitude and phase shifts for 
the exact system, and let A u and <p u be the shifts for the 
approximate system. Writing the exact and approximate 
shifts in complex form, we have 



hi < max 



A P il í>^ _ A f,i<l>« 



(30) 



Thus even though sinusoidal inputs and outputs are un- 
bounded in the £2 norm (27), the error is controlled in 
this fashion. 



V. CONCLUSION 

According to jjj, balanced realizations first appeared 
in the control literature in 1981, and the proof of the 
error bound on truncated models first appeared in 1984. 
Until rcccntly, howcver, the physics community has made 
little use of this powerful tool. We believe that balanced 
truncation will be of particular value when building sim- 
ulations of and theoretical models for the evolution of 
macroscopic quantities in large complex systems, and it 
is hoped that these notes will be hclpful in such studies. 
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